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Abstract 

Translation is a central biological process by which proteins are synthesized from genetic information contained within mRNAs. 
Here we study the kinetics of translation at molecular level through a stochastic simulation model. The model explicitly include 
RNA sequences, ribosome dynamics, tRNA pool and biochemical reactions in the translation elongation. The results show that 
the translation efficiency is mainly limited by the available ribosome number, translation initiation and the translation elongation 
time. The elongation time is log-normal distribution with mean and variance determined by both the codon saturation and the 
process of aa-tRNA selection at each codon binding. Moreover, our simulations show that the translation accuracy exponentially 
decreases with the sequence length. These results suggest that aa-tRNA competition is crucial for both translation elongation, 
translation efficiency and the accuracy, which in turn determined the effective protein production rate of correct proteins. Our 
results improve the dynamical equation of protein production with a delay differential equation which is dependent on sequence 
informations through both the effective production rate and the distribution of elongation time. 
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1. Introduction 

Translation is a central biological process by which genetic 
information contained within mRNAs is interpreted to generate 
proteins. Ribosomes provide the environment for all activities 
of the translation process, such as the formation of the initia¬ 
tion complex, the elongation of the translation process by which 
the ribosome moves along the mRNA sequence, and the disso¬ 
ciation of the ribosome from the mRNA. Protein synthesis is 
principally regulated at the initiation stage and hence the pro¬ 
tein production rate is mainly limited by the availability of free 
ribosomes 112 ED. During translations, the ribosome selects 
matching aminoacylated tRNA (aa-tRNA) to the mRNA codon 
from a bulk of non-matching tRNAs, and the reaction rate con¬ 
stants can show 350-fold difference in the stability of cognate 
and near-cognate codon-anticodon complexes 0. Hence, in 
addition to the initiation stage, the translation efficiency is also 
affected by the mRNA sequence and the competition between 
cognate and near-cognate tRNAs mmm. Global under¬ 
standing of how ribosome number, mRNA sequence, and tRNA 
pool combine to control the translation kinetics has been an in¬ 
teresting topic in recent years for its potential impacts on the 
biogenesis and synthesis biology El H3 El Eol. 

Computational models have been developed to investigate 
details of the translation kinetics and to explore the main fac¬ 
tors that affect the translation efficiency, such as codon bias, 
tRNA and ribosome competition, ribosome queuing, codon or¬ 
der m (2 Q 031 OS (SB .211 • In these models, status of all 
ribosomes and tRNAs along a mRNA are tracked in continu¬ 
ous time. Translation initiation and the availability of free ri¬ 
bosomes were highlighted in previous studies Il3l l20l |2D . In 
eh, it was found that the varying of translation efficiency were 


caused by very short times of translation initiation. Through a 
model that tracks all ribosomes, tRNAs and mRNAs in a cell, 
it was concluded that the protein production in healthy yeast 
cells was typically limited by the availability of free ribosomes, 
however the protein production under stress was rescued by re¬ 
ducing the initiation or elongation rates EQ] . Codon bias of a 
mRNA sequence is an important factor that may affect the the 
translation efficiency due to competitions of tRNAs EH2EI. a 
study of S. cerevisiae genome suggest that tRNA diffusion away 
from the ribosome is slower than translation, and hence codon 
correlation in a sequence can accelerate translation because the 
same tRNA can be used by nearby codons (2). In the elon¬ 
gation process, a cognate, near-cognate, or non-cognate tRNA 
may attempt to bind to the A site of a ribosome. A study based 
on a computation model with detailed tRNA pool composition 
shows that the competition between near-cognate and cognate 
tRNAs is a key factor that determines the translation rate 
Another study by a mean-field model of translation in S. cere¬ 
visiae shows that the competition for ribosomes, rather than 
tRNAs, limits global translation El. Ribosome collisions can 
also reduce the translation efficiency according to a model of 
stochastic translation process for E. coli lacZ mRNA as a traffic 
problem 031 . From the point of view of evolution, the mech¬ 
anism for controlling the efficiency of protein translation was 
evolutionarily conserved according to a calculation on adapta¬ 
tion between coding sequences and the tRNA pool |[23l . More¬ 
over, using a nested model of protein translation and population 
genetics to the genomic of S. cerevisiae, it was suggested that 
the codon usage bias of genes can be explained by the evolu¬ 
tion through the selections for efficient ribosomal usage, ge¬ 
netic drift, and biased mutation, and the selection for efficient 
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ribosome usage is a central force in shaping codon usage at the 
genomic scale tm 

Despite extensive studies, much details of how translation is 
controlled by mRNA sequences and cellular environment re¬ 
main exclusive. Both the number of available free ribosomes 
and the codon orders are important for translation efficiency, 
however how various factors combine to determine the transla¬ 
tion efficiency is not clearly formulated. Since a codon is bound 
by a near-cognate tRNA, proteins with mismatched amino acids 
can be produced in translations. Hence, the translation accuracy 
may be dependent on the codon usage of a sequence and the 
composition of tRNAs, but little result about the dependence 
is known to the best of our knowledge. The relation of how 
the timing of ribosome elongation stage depends on a sequence 
and the tRNA pool is closely related to the modeling of genetic 
network dynamics in which the elongation time associates with 
the time-delay in dynamical equations l22l|26l[27l . but how the 
elongation time is formulated remains mystery. 

In this paper the translation kinetics is considered through a 
stochastic computation model with detailed reactions of the ri¬ 
bosome dynamics. In our study, several factors including the 
coding sequence, ribosomes, and the composition of tRNA tool 
were modeled to investigate how the translation efficiency, ac¬ 
curacy, and elongation time are determined. Moreover, transla¬ 
tion dynamics of various mRNA sequences (yeast and human, 
coding and non-coding mRNAs) were studied to try to clarify 
whether or not the sequence is important for the translation ef¬ 
ficiency and the translation accuracy. Our results show that the 
translation efficiency is mainly limited by the number of the 
available ribosomes, translation initiation and the elongation 
time of translation, and the elongation time is log-normal distri¬ 
bution with mean and variance of the logarithm of the elonga¬ 
tion time dependent on the sequence through aa-tRNA usages. 
Moreover, the translation accuracy exponentially decrease with 
the sequence length. These results provide more detailed un¬ 
derstanding of the translation processes, and can improve the 
mathematical modeling of protein production in gene regula¬ 
tion network dynamics. 

2. Model and methods 

2.1. Model description 

Fig. □ illustrates our model of ribosome kinetics in trans¬ 
lation presented in m We summarize the model description 
below and refer |6| for details. 

Translation of a protein begins from the initiation stage by 
which the start codon (AUG site) of the mRNA sequence is oc¬ 
cupied by a ribosome, and the peptide between the first two 
amino acids are formed, with corresponding aa-tRNAs bind¬ 
ing to the E and P sites of the ribosome, respectively. Dur¬ 
ing the elongation, each move of the ribosome includes 9 steps 
as shown by Fig. [Tj initial binding of aa-tRNA, codon recog¬ 
nition, GTPase activation, GTP hydrolysis, EF-Tu conforma¬ 
tion change, rejection, accommodation, peptidyl transfer, and 


Sec http://v.youku. com/v_show/id_XNzMxNzEwNjgO .html for an 
animation of translation. Kindly provided by Prof. Ada Yonath. 
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Figure 1: Kinetic scheme of RNA translation. Re-drew from l6l . 


Table 1: 

Values of kinetic rate constants (s *) (refer to 0) 

Parameters 

Values 

Cognate 

Near-cognate 

Non-cognate 

K 

0.03 

- 

- 

- 

kl 

- 

140 

140 

2000 

kOl 

- 

85 

85 

- 

k2 

- 

190 

190 

- 

k02 

- 

0.23 

80 

- 

k3 

- 

260 

0.4 

- 

kG 

- 

1000 

1000 

- 

k4 

- 

1000 

1000 

- 

k5 

- 

1000 

60 

- 

k7 

- 

60 

1000 

- 

kp 

- 

200 

200 

- 

kT 

- 

20 

20 

- 


translocation. For each codon on the mRNA sequence, tRNAs 
in the tRNA pool are divided into three types: cognate, near¬ 
cognate, and non-cognate, as listed in (6j. All aa-tRNAs can 
attempt to bind to the A site of the ribosome according to the 
match between codon and anticodon E), however only cognate 
and near-cognate aa-tRNAs can go through the step of peptide 
formation, non-cognate aa-tRNAs are rejected by codon recog¬ 
nition. Cognate aa-tRNAs give the correct amino acid follow¬ 
ing the genetic code, while near-cognate tRNAs often bring in¬ 
correct amino acids and yield a defect protein. Reactions rates 
are different for cognate and near-cognate tRNAs, which have 
been reported at mm and are given by Table [T] in our simula¬ 
tions. We note that near-cognate aa-tRNAs are more likely to be 
rejected at both steps of codon recognition and rejection. There¬ 
with, the competition between cognate and near-cognate tRNAs 
may be crucial for both the fidelity of peptide synthesis and 
translation efficiency mm. After peptidyl transfer, the E site 
aa-tRNA is released and the ribosome move forward a codon 
with the A site free waiting for the next move. Translation of 
a polypeptide stops when the ribosome reaches a stop codon 
(UAG/UAA/UGA), where the polypeptide is released and the 
ribosome drops off from the mRNA. One ribosome can synthe¬ 
size only one polypeptide at a time, and each mRNA can be 
translated simultaneously by multiple ribosomes. The multiple 
ribosomes forms a queue along the mRNA, with a safe distance 
of at least 10 codons between two ribosomes 03 US- 
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Table 2: tRNA pool composition (refer to (HIS))' Also refer 0 for the anti- 
codons for the tRNAs._ 


tRNA 

Molecules/cell 

tRNA 

Molecules/cell 

tRNA 

Molecules/cell 

Alai 

3250 

His 

639 

Pro3 

581 

Ala2 

617 

Ilel 

1737 

Sec 

219 

Arg2 

4752 

Ile2 

1737 

Seri 

1296 

Arg3 

639 

Leul 

4470 

Ser2 

344 

Arg4 

867 

Leu2 

943 

Ser3 

1408 

Arg5 

420 

Leu 3 

666 

Ser5 

764 

Asn 

1193 

Leu4 

1913 

Thrl 

104 

Aspl 

2396 

Leu5 

1031 

Thr2 

541 

Cys 

1587 

Lys 

1924 

Thr3 

1095 

Glnl 

764 

Met fl 

1211 

Thr4 

916 

Gln2 

881 

Met f2 

715 

Trp 

943 

Glu2 

4717 

Met m 

706 

iyrl 

769 

Glyl 

1068 

Phe 

1037 

Tyr2 

1261 

Gly2 

1068 

Prol 

900 

Vail 

3840 

Gly3 

4359 

Pro2 

720 

Val2A 

630 





Val2B 

635 




Figure 2: Translation kinetics of a single mRNA sequence, (a) Positions of 
each ribosome on the sequence, (b) Numbers of protein products. Black solid 
line for all protein products, red dashed line to correctly translated proteins 
(no incorrect amino acid added by near-cognate aa-tRNAs). Here the sample 
sequence is the gene YAL003W from the SGD yeast coding sequences, with 
sequence length L = 621«f. Parameters are R = 20, F = 0.03 and other 
parameters refer Table ^ 


2.2. Numerical scheme 

The translation process with multiple ribosomes was mod¬ 
eled with the stochastic simulation algorithm (SSA) (7), which 
includes the following reactions: 

1. binding of a ribosome to the start codon if the first 10 
codons are not occupied by ribosomes; 

2. binding of an aa-tRNA from the tRNA pool to the A site 
of an unoccupied ribosome; 

3. reactions of codon recognition, energy transformation, and 
peptide formation; 

4. releasing of the tRNA from the E site of a ribosome; 

5. translocation of the ribosome to the next codons if the 
safety condition is satisfied; 

6. dropping off of the ribosome once the stop codon is 
reached. 

Kinetics parameters are given by Table [T] which refer to |6). 
The tRNA pool compositions referred the total number of each 
tRNA in a yeast cell from |5] [6l and are given by Table [2] In 
simulations, to mimic the effects of available tRNAs for each 
single mRNA translation, we used a factor F (0 < F < 1) to all 
tRNA numbers to adjust the changes in the numbers of available 
tRNAs. For the anti-codon of each tRNA and the cognate, near¬ 
cognate, and non-cognate for each codon, refer |6| for details. 

It has been shown that the availability of free ribosomes is an 
important limitation for the translation efficiency 1201 . Here we 
introduced a parameter R for the maximum number of available 
ribosomes can be used for a single sequence translation. We 
note that a ribosome can be re-used after it was released from 
the stop codon. 

An example of translation kinetics obtained from our simu¬ 
lation is given at Fig. [2] which shows that the ribosomes se¬ 
quencing along the mRNA, and the number of protein produc¬ 
tion increases linearly with the translation time. The average 
translation rate (amino acids per second) in our simulation is of 
a order of 10, in good agreement with the experimental obser¬ 
vations 03- These suggest well defined translation efficiency, 
elongation time, and accuracy of a translation as given below. 

2.3. Translation efficiency, elongation time, and accuracy 

To quantify the translation process, we consider the transla¬ 
tion efficiency for the protein production rate, the elongation 


time for move kinetics of each individual ribosome, and the 
translation accuracy for the fidelity of translation. The trans¬ 
lation efficiency ( TE ) is defined as the average slope of the in¬ 
creasing of protein production number with the translation time. 
The elongation time of each ribosome is given by the time pe¬ 
riod from the binding of a ribosome to the start codon to its 
dropping off from a stop codon. The elongation time per codon 
(ETC), the average time for a ribosome to move one codon, 
is often used to describe the translation kinetics. The elonga¬ 
tion time is given by ETC x L/3, where L is the length (in 
nt) of a mRNA sequence. Since a protein product may contain 
mismatched amino acids due to the binding of near-cognate aa- 
tRNAs with the mRNA, it was possible to have incorrect protein 
products in the translation. Hence, the ratio of correct proteins 
in all protein produces gives the translation accuracy. 

3. Results 

3.1. Translation elongation time is log-normal distribution and 
sequence dependent 

The elongation time measures how long it takes a ribosome 
to finish the translation of a protein, which corresponds to the 
delay of translation in modeling the dynamics of gene regula¬ 
tion networks through delay differential equations l26l[27l . The 
production of proteins can be described by translation efficiency 
a and mRNA number M(t) through a delay differential equation 
of form 

dP f + ” 

— =a M(t-f)p{r)d t, (1) 

at Jo 

where r represents the elongation time, with distribution den¬ 
sity p(r). 

To obtain the formulation of the distribution density p(r), we 
calculated the elongation time per codon (ETC) in the transla¬ 
tion of YAF003W (here we note r = ETCxL/3). The distribu¬ 
tion density is showed at Fig [3] The density function was well 
fitted by log-normal distribution 

1 (lnx-jj) 2 

\i\N(p,a 2 )- -— e , x > 0. (2) 

xcr y2n 

Here the shape parameters p and cr are taken so that the log¬ 
arithm of ETC has mean p and variance cr 2 . Fet n - L/3 be 
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the number of amino acids in a protein product, the density 
function p(r) of elongation time is given from the log-normal 
distribution Eq. [2] as 


1 (]DT~\nn-ii) 2 



(3) 


while the competition of non-cognate tRNAs has only little af¬ 
fects to the elongation time. Moreover, Fig. [4] suggests typical 
parameters for the distribution of the ETC of yeast coding gene 
translation are p ~ -1.5 and cr 2 ~ 1.4 (refer to Eq. [2]). Our sim¬ 
ulations suggest no obvious dependence of ETC with sequence 
length L (data not shown). 


and the average elongation time is 

+OO 

Tp(j)dT = ne^' 4 ' 0 ^ 2 . (4) 

In the next section we show that the translation efficiency is 
dependent on the average elongation time, which refines our 
dynamical equation for protein production. 
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Figure 3: Distribution of the elongation time per codon in the translation of 
YAL003W. All parameters are the same at Fig. [ 2 ] Red curve is the fit with 
normal distribution hi ,V(-1 6.1.69). 



Each move of a ribosome consists of several chemical reac¬ 
tions shown by Fig. [T] including selections of cognate or near¬ 
cognate aa-tRNA from the tRNA pool, and a step forward if 
the safety condition is satisfied. To investigate how the ETC 
depends on the mRNA sequence and translation kinetics, we 
examined translations of a set of 1000 sequences from yeast 
coding genes with length L varies from 51 to 1995 nt (17 to 
665 codons). To measure the tRNA usage of each sequence, we 
calculated the average fraction of cognate, near-cognate, and 
non-cognate tRNA along the sequence, which are defined as 


F v 


. L /3 

1 yA «,, v 

L/3 ^ Total tRNA number’ 


v = cog, near, non. 


(5) 


here F v (v = cog, near, non) measures the average tRNA usage 
of cognate, near-cognate, and non-cognate tRNAs, respectively. 
The summation is taken over all codons, and n LV is the number 
of tRNAs of type v for codon i along the mRNA sequence. 

Fig. 0 shows the dependence of the mean (//) and variance 
(cr 2 ) of the logarithm of ETC with tRNA usages. Results sug¬ 
gest that the mean decreases with the cognate tRNA usage F cog , 
increases with the near-cognate tRNA usage F near , and has no 
correlation with the non-cognate tRNA usage F non , while the 
variance is not dependent on either F cog or F near , but weakly 
decreases with F non . These results suggest that the competition 
of near-cognate tRNAs tends to increase the elongation time. 



Figure 4: Dependence of the ETC of yeast coding sequences on tRNA usages. 
Dots show the mean (upper panel) and variance (bottom panel) of the loga¬ 
rithm of ETC with cognate tRNA usage E cog , near-cognate tRNA usage F near 
and non-cognate tRNA usage F non , respectively. Dashed lines show the lin¬ 
ear fitting. Simulations of 1000 yeast coding sequences are shown, each dot 
corresponds to one sequence. All parameters are the same as Fig. [ 2 ] 


The investigate how the available ribosomes number R af¬ 
fects the elongation time, we changed the value R to calculate 
the dependence of ETC. Results showed that both mean and 
variance of the logarithm of ETC are dependent on R nonlin- 
early: mostly independent to R when R is either small or large, 
and an obvious increasing dependence when R takes interme¬ 
diate values (Fig. A possible reason for the increasing of 
ETC is the traffic jam due to codon occupation. Fig. [5J3 shows 
that the average ribosome distance obviously decreases with R 
at the intermediate region, and approaches a minimum distance 
(the safe distance of 10 codons) when R is large. These results 
reveal that the increasing dependence of the elongation time 
with ribosome number R (10 < R < 30) is due to the increasing 
of traffics jam in translation kinetics. 



R R 


Figure 5: Dependence of the elongation time with the available ribosomes num¬ 
ber R. (a) Average ETC versus R. (b) Ribosome distance (in codons) versus R. 
Sequence and parameters are the same as Fig. [ 2 ] 


In the above calculations, the total number of tRNAs was 
fixed. To further examine how the number of total tRNAs af¬ 
fects the elongation time, we varied the factor F from 0.03 to 
1 to calculate the dependence of ETC. Results showed that 
both mean (p) and variance (cr 2 ) of the logarithm of ETC are 
decreasing with F for small F, and nearly unchanged when 
F > 0.5 (Fig. [6). Biologically these dependences are obvious 
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n — 1 


because it takes longer time to select a cognate or near-cognate 
when there are no enough tRNAs. 


Hence, let r 


be approximate to the elongation time 


c 

(here we note 1 /c corresponds to the average of ETC), the trans¬ 
lation efficiency satisfies 
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Figure 6: Dependence of the ETC with total tRNAs number represented by the 
factor F. The mean (left hand ordinate, blue circles connected with a dashed 
line) and variance (right hand ordinate, red triangles connected with a dotted 
line) of the logarithm of ETC are shown as a function of the factor F. Sequence 
and parameters are the same as Fig. [2] 


3.2. Translation efficiency is mainly dependent on the elonga¬ 
tion time and available ribosomes number 
in Gol ED, it has been shown that the varying of transla¬ 
tion efficiency were caused by translation initiation, and the 
availability of free ribosomes was a typical rate limit of trans¬ 
lation. To investigate how the translation efficiency depends on 
the translation kinetics and mRNA sequences, we constructed a 
model to track the dynamics of available ribosomes. 

Consider a mRNA with n codons (n = L/3). Let R be the 
number of available ribosomes, x,(t) (i = 1, • • • , n) the number 
of ribosomes at the / th codon at time t, and xo(t) the number 
of free ribosomes. Kinetics of a ribosome in translation is a 
combination of initiation in a rate K, elongation per codon in 
a rate c and termination in a rate Kj. Therefore, the dynamics 
of xi can be expressed by the following differential equations 
model 


TE oc 


RK 

Kf + 1 + K/K t ' 


( 10 ) 


When R is large so that all codons are occupied, the transla¬ 
tion efficiency is mainly determined by the elongation time so 
that TE oc 1/f. Hence, taking account Eq. [TO] the translation 
efficiency can be approximated as 


K min{R,R max } 
Kf + 1 + K/K t ’ 


( 11 ) 


where R max is the number of available ribosomes to saturate 
all codons. We take R max = n /10 in our simulations, which 
is consistent with Fig. [5] We note that f is dependent on R 


suggests the following dependence of translation efficiency on 
the ribosome number R: linearly increases when R is small, 
independent to R when R is large, and nonlinear dependence 
through the elongation time f when R takes intermediate values. 
These results are in agree with our numerical simulations (Fig. 

El- 


according to the above discussions, hence the relation Eq. 11 



" dxo 

—— = KjX n - Kxo 

at 
ax i 

—— = Kxq - cx i 

i’ Xl 

— = c(xt- 1 - Xi) I - 2,3, •••,«- 1 

eft 

Li 

— = cx„^i - K t x„. 

at 

The protein production rate is proportional to x n . Here we note 

0 < xo < R, 0 < x, < 1 (i = 1,2, ■ ■ ■ , n), (7) 

and 

n 

T J x i =R. (8) 

!=o 

When Kj > c and R is small, Eq. [6]has a stable equilibrium 
state which gives 


R 


Xn = 


n — 1 Kj 

K t -+ 1 + — 

c K 


(9) 


Figure 7: Dependence of translation efficiency with the maximum number of 
available ribosomes R. Dashed lines show the two-phase dependence following 
Eq. 1 1 1 1 Sequence and parameters are the same as Fig. [2] 


The result Eq. 11 supports the previous findings that trans¬ 
lation initiation and ribosome number are rate limits of protein 
production. Moreover, the translation efficiency decreases with 
the elongation time, which shows the dependence of protein 
production with the mRNA sequence through the elongation 
dynamics. 

Since the average elongation time f is proportional to the pro¬ 
tein length n, Eq. El] suggests that the translation efficiency 
depends on the protein length n through a Michaelis-Menten 
function. Fig. [8jr shows translation efficiency versus sequence 
length for yeast coding sequences with different length. The 
translation efficiency is well fitted by a Michaelis-Menten func¬ 
tion, in agreement with our theoretical conclusion Eq. ED 
To further investigate the sensitivity of translation efficiency 
with the changes in parameters, we increased or decreased each 
of the parameters in Table [T] and examined the changes in the 
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Figure 8: Translation kinetics, (a) Translation efficiency versus sequence 
length for 1000 yeast coding sequences. Red line shows the fitting with 

TE = -. (b) Translation accuracy versus sequence length for 1000 

1 + 0.0033n 

yeast coding genes. Red line shows the fitting with e -° 0042 ". Here n = L/3 is 
the protein chain length. Data obtained from the simulation at Fig. [4] 

translation efficiency. The results showed that the translation 
efficiency is sensitive with the changes in k02 (or ke02 for near¬ 
cognate tRNA), kOl, kl, k2, which correspond to the process of 
aa-tRNA selection. The translation initiation K is also impor¬ 
tant for the translation efficiency, as we have seen from Eq. 11 
Changes in other parameters led to minor changes in the transla¬ 
tion efficiency. These results indicate that the steps of aa-tRNA 
selection are crucial for the translation efficiency through their 
effects to the elongation time, and changes in the steps of pep¬ 
tide formation have minor effects to the translation efficiency. 
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t; 

'uj o 
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Figure 9: Sensitivity analysis of the translation efficiency. Bars show changes 
in the logarithm of translation efficiencies induced by changes in a single pa¬ 
rameter ln(F E*/TEo), where TE* and TEq represents the TE for modified and 
default parameters, respectively. Blue bars correspond to increasing of a pa¬ 
rameter by 10%, and yellow bars correspond to decreasing of a parameter by 
10%. For parameters refer to Table]!] the parameters ke02, ke3, ke5, ke7, keT 
for values of k02, k3, k5, k7, kT of near-cognate tRNAs (second column in 
Table |TJ. respectively, and knOl for the parameter kOl for non-cognate tRNAs 
(third column in Table[I] Sequence and defaults parameters are same as Fig. [2] 
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3.3. Translation accuracy decreases exponentially with se¬ 
quence lengths 

In translations, protein products may contain mismatched 
amino acids when a near-cognate aa-tRNA is selected and suc¬ 
cessfully form a peptide. Hence the translation accuracy, frac¬ 
tion of correct protein products, should be exponentially de¬ 
cay with the chain length, the decay rate is associated with the 
probability of selecting a near-cognate aa-tRNA at each step. 
Fig. [8p shows the translation accuracy versus sequence length, 
which is well fitted with an exponential function. 

In living cells, abnormal proteins are usually degraded 
quickly so that the intracellular amino acids can be recycled ef¬ 
ficiently. Hence, only correctly translated proteins are relevant 
in modeling the dynamics of gene regulation networks. This 


yields a factor by translation accuracy in the production rate of 
normal proteins in the equation Eq. |T] 

The above discussions suggest a more refined equation for ef¬ 
fective protein production Eq. [Tjwith p(r) given by log-normal 
distribution Eq. [3j and the effective translation efficiency a is 
given by 


a 


ac 


—cn 


1 + bnef +a ' 2 l 2 


( 12 ) 


where parameters a, b depend on available ribosomes number, 
translation initiation and termination, and c relates with the 
composition of the tRNA pool. A crucial refinement of Eq. 


12 is the dependence of protein chain length n, and other pa¬ 


rameters are somehow universal for differential proteins, except 
weak dependence of p and <x 2 with sequences as shown by Fig. 
[4] under certain cellular conditions. 


3.4. Translation kinetics with sequence dependences 

A motivation of this study was trying to examine whether 
there are distinct dynamics for coding and non-coding RNA se¬ 
quences in the translation. We have shown that the translation 
efficiency depends on the mRNA sequence through the elonga¬ 
tion time, and the mean and variance of the elongation time per 
codon are dependent on the sequence through the aa-tRNA us¬ 
ages. A study of ribosome occupancy showed that many large 
noncoding RNAs are bound by ribosomes and hence are pos¬ 
sible to be translated into proteins ifTOl fill . To investigate the 
translation kinetics of coding and noncoding RNAs, we applied 
the model simulation to yeast coding RNA, yeast noncoding 
RNA, human coding RNA and human noncoding RNA. In each 
sample, 500 sequences with lengths between 200nt and 1000 nt 
were selected, however most of the noncoding RNA has reading 
frames with lengths less than 300 nt, in agreement with the ob¬ 
servations in 0. Simulations showed that the previous results 
are qualitatively held for different samples. Fig. [10] shows the 
distributions of mean ip) and variance (cr 2 ) of the logarithm of 
the ETC for each set of the simulations (The average of p and 
cr 2 for each ample are given by the table), which are crucial pa¬ 
rameters in the density function Eq. [2] From Fig. 10 we have 
the following observations: coding RNAs (both yeast and hu¬ 
man) have similar distribution in p and cr 2 ; noncoding RNAs 
have smaller variance cr 2 comparing with the corresponding 
coding RNAs. These results reveal distinct translation kinet¬ 
ics statistically between coding and noncoding RNAs, waiting 
biological significance of the findings to be discovered. 


4. Discussions 

We have applied stochastic simulation to study the transla¬ 
tion kinetics at molecular level. In the model, RNA sequences, 
ribosome dynamics, tRNA pool and biochemical reactions in 
the elongation steps were included. Simulations showed that 
during translations the ETC satisfied log-normal distribution 
(Fig. PTl, and is mainly determined by both codon saturation 
(Fig. [5^ and the steps of aa-tRNA selection (Fig. |9|. In the 
tRNA selection, the relative numbers of near-cognate to cog¬ 
nate aa-tRNAs are crucial for the elongation of a ribosome, and 
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(a) 


(b) 




(c) (d) 





Mean (p) 

Variance (cr 2 ) 

Yeast coding 

-1.4895 

1.4256 

Yeast non-coding 

-1.6410 

1.1948 

Human coding 

-1.3773 

1.5981 

Human non-coding 

-1.3295 

1.4885 


Figure 10: ETC of the translations for different samples. Distributions of Mean 
and variance of the logarithm of ETC for yeast coding RNAs (a), yeast noncod¬ 
ing RNAs (b), human coding RNAs (c) and human noncoding RNAs (d). Here 
the results of 500 random sequences with length 200/:/ < L < 1000/// for each 
sample are shown. Red stars show the average values for each sample, values 
are given by the table. Parameters are R = 20, F = 0.03, and other parameters 
are referred to Table[T] 


hence the mean value of the logarithm of ETC is dependent on 
the tRNA usages defined by F cog and F near in this paper (Fig. 
[4}. In the log-normal distribution Eq. [2] the mean p and vari¬ 
ance cr 2 are important for the density function of the elongation 
time. We showed that these two parameters are slightly differ¬ 
ent for coding and noncoding RNAs for both yeast and human 
samples (Fig. [T0| ). On average, noncoding RNAs have smaller 
variance than coding RNAs. A simple model of ribosome dy¬ 
namics revealed that the translation efficiency is mainly deter¬ 
mined by the number of available ribosomes, translation initia¬ 
tion and the elongation time, and the translation efficiency de¬ 
pends on the elongation time through a Michaelis-Menten func¬ 
tion. The translation efficiency increases with the available ri¬ 
bosomes number when the number is small, however is insensi¬ 
tive with the ribosome number when the number is large enough 
to saturate all codons. These results are further confirmed by 
our simulations. Moreover, the translation accuracy decreases 
exponentially with sequence lengths. These results suggest an 
improvement for the effective protein production when we are 
modeling gene expressions in gene regulation networks. 

In modeling gene expressions the protein production is de¬ 
scribed by a delay differential equation of form Eq. |T] that de¬ 
pends on the translation efficiency a and the distribution p(r) of 
elongation time. This study showed that the effective produc¬ 
tion of correct proteins can be expressed as 


ae 


a 


] + bne IJ+a ' 2 / 2 ’ 


(13) 


here n is the protein chain length (number of amino acids), p 
and cr 2 are mean and variance of the logarithm of the ETC , 
respectively, parameters a and b are dependent on available ri¬ 
bosomes number, translation initiation and termination, and c 
relates with the composition of the tRNA pool. The distribution 
of the elongation time is formulated as 


1 flnr-ln.. :r 

p(r) = -— e *«■- . (14) 

tct y2n 

Hence, the protein production equation Eq. [T]can be rewritten 
as 


dP ae 
dt 1 + bne^ +a " 


- f 

3 ' 2 Jo 


M(t-f)- 


- V27T 


(lnT-lnn-p) z 

e 2.2 dT, (15) 


where M(t) is the number of mRNAs at time t. In this equation, 
the protein chain length n is explicitly included. Moreover, the 
sequence information are implicitly included in the parameter 
r and cr 2 which are mainly determined by the process of aa- 
tRNA selection in each step of ribosome movement. Yet other 
parameters are somehow universal under given cellular condi¬ 
tions. A direct conclusion from Eq. 15 is the extreme low 


effective production rates of long proteins, which is because 
of the long elongation time and low translation accuracy for 
these long chain molecules. This is in consistent with biological 
observations that many transcription factors are small proteins 
with high production rates (many of them have high degrada¬ 
tion rates as well) m , and many structural proteins (e.g. fibers) 
and transport proteins (e.g. membrane proteins) are large pro¬ 
teins with low production rates (these proteins are mostly very 
stable) l25l . Hence, this study provides insightful details for the 
known observations, and is valuable in further works of whole 
cell modeling. 


Data Resources 

RNA sequences are downloaded from available databases: 

• Yeast coding RNAs from SGD 
(http://downloads.yeastgenome.org/sequence/S288C_reference/ 
orf_dna/orf_coding.fasta.gz). 

• Yeast noncoding RNAs from SGD 

(http://downloads.yeastgenome.org/sequence/S288C_reference/ 
ma/ma_coding.fasta.gz). 

• Human coding RNAs from Ensembl Genome Browser 
(http://useast.ensembl.org/biomart/martview/8a921aclac4642b 
07708af32f2339655). 

• Human noncoding RNAs from Genecodel9 
(ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_19 
/gencode.vl9.1ncRNA_transcripts.fa.gz). 
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